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Abstract. The Kantorovich function (x T Ax){x T A~ l x) , where A is a positive definite ma- 
trix, is not convex in general. From matrix/convex analysis point of view, it is interesting to 
address the question: When is this function convex? In this paper, we investigate the convex- 
ity of this function by the condition number of its matrix. In 2-dimensional space, we prove 
that the Kantorovich function is convex if and only if the condition number of its matrix is 
bounded above by 3 + 2y2, and thus the convexity of the function with two variables can be 
completely characterized by the condition number. The upper bound l 3+2^/2 , is turned out to 
be a necessary condition for the convexity of Kantorovich functions in any finite-dimensional 
spaces. We also point out that when the condition number of the matrix (which can be any 



dimensional) is less than or equal to y 5 + 2\/6, the Kantorovich function is convex. Further- 
more, we prove that this general sufficient convexity condition can be remarkably improved in 
3-dimensional space. Our analysis shows that the convexity of the function is closely related to 
some modern optimization topics such as the semi-infinite linear matrix inequality or 'robust 
positive semi-definiteness' of symmetric matrices. In fact, our main result for 3-dimensional 
cases has been proved by finding an explicit solution range to some semi-infinite linear matrix 
inequalities. 
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1 Introduction 



Denote by 

K(x) = (x T Ax)(x T A- 1 x) 

where x £ R n and A is a given n x n symmetric, positive definite real matrix with eigenvalues 
< Ai < A2 < • • • < A n . Then, we have the following Kantorovich inequality (see e.g. 
[17, 21, 22, 24, 31]): 

|M|4 > 4AlA " (x T Axx T A- 1 x) for any x G R n . 

This inequality and its variants have many applications in matrix analysis, statistics, numerical 
algebra, and optimization (see e.g. [7, 11, 15, 16, 22, 23, 25, 26, 30, 32, 33, 34]). In this paper, 
K(x) is referred to as the 'Kantorovich function'. While K(x) has been widely studied and 
generalized to different forms in the literature, from matrix/convex analysis point of view some 
fundamental questions associated with this function remain open or are not fully addressed. 
For instance, when is this function convex? Is it possible to characterize its convexity by the 
condition number of its matrix? 

Let us first take a look at a simple example: Let A be a 2 x 2 diagonal matrix with diagonal 
entries 1 and 6. Then A^ 1 is diagonal with diagonal entries 1 and 1/6, and it is easy to verify 
that in this case the function K(x) is not convex (since its Hessian matrix is not positive 
semi-definite at (1,1)). Thus, the Kantorovich function is not convex in general. The aim of 
this paper is trying to address the above-mentioned questions, and to develop some sufficient 
and/or necessary convexity conditions for this function. 

We will prove that the Kantorovich function in 2-dimensional space is convex if and only if 
the condition number of its matrix is less than or equal to 3 + 2^/2. Therefore, the convexity of 
this function can be characterized completely by the condition number of its matrix, and thus 
the aforementioned questions are affirmatively and fully answered in 2-dimensional space. For 
higher dimensional cases, we prove that the upper bound '3 + 2a/2' of the condition number 
is a necessary condition for the convexity of K{x). In another word, if K{x) is convex, the 
condition number of A must not exceed this constant. On the other hand, we show that if 
the condition number of the matrix is less than or equal to then K(x) must be 

convex. An immediate question is how tight this bound of the condition number is. Can such 
a general sufficient condition be improved? A remarkable progress in this direction can be 
achieved at least for Kantorovich functions in 3-dimensional space, for which we prove that 
the bound \J 5 + 2\/~6 can be improved to 2 + \/3. The proof of such a result is far more than 
straightforward. It is worth mentioning that we do not know at present whether the result in 
2-dimensional space remains valid in higher dimensional spaces. That is, it is not clear whether 
or not there is a constant 7 such that the following result holds: K(x) is convex if and only 
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if the condition number is less than or equal to 7. By our sufficient, and necessary conditions, 
we may conclude that if such a constant 7 exists, then \J 5 + 2\/6 < 7 < 3 + 2^2. 

The investigation of this paper not only yields some interesting results and new under- 
standing for the function K(x), but raises some new challenging questions and links to certain 
topics of modern optimization and matrix analysis as well. First, the analysis of this paper 
indicates that the convexity issue of K(x) is directly related to the so-called (semi-infinite) 
linear matrix inequality problem which is one of the central topics of modern convex optimiza- 
tion and has found broad applications in control theory, continuous and discrete optimization, 
geometric distance problems, and so on (see e.g. [8, 10, 11, 14, 18, 28]). In fact, the convexity 
condition of K(x) can be formulated as a semi-infinite linear matrix inequality problem. In 
3-dimensional space, we will show that how such a semi-infinite matrix inequality is explicitly 
solved in order to develop a convexity condition for K{x). 

The convexity issue of K(x) can be also viewed as the so-called 'robust positive semi- 
definiteness'' of certain symmetric matrices, arising naturally from the analysis to the Hessian 
matrix of K(x). The typical robust problem on the positive semi-definiteness can be stated as 
follows: Let the entries of a matrix be multi-variable functions, and some of the variables can 
take any values in some intervals. The question is what range of values the other variables 
should take such that the matrix is positive semi-definite. Clearly, this question is also referred 
to as a robust optimization problem or robust feasibility/stability problem [2, 3, 4, 5, 6, 12, 
13, 27]. 'Robust positive semi-definiteness' of a matrix may be stated in different versions. For 
instance, suppose that the entries of the matrix are uncertain, or cannot be given precisely, 
but the ranges (e.g. intervals) of the possible values of entries are known. Does the matrix 
remain positive semi-definite when its entries vary in these ranges? Our analysis shows that 
the study of the convexity of K{x) is closely related to these topics. 

Finally, the convexity issue of K{x) may stimulate the study of more general functions than 
K(x). Denote by q A (x) = ^x T Ax. Notice that q A (x) = ^x T A~ l x = (7,4-1(2;) is the Legendre- 
Fenchel transform of q A (x) (see e.g. [1, 9, 20, 29]). The Kantorovich function can be rewritten 
as 

K(x) = 4q A (x)q* A (x) = 4q A (x)q A -i(x). 

Thus, K(x) can be viewed as the product of the quadratic form q A (x) and its Legendre-Fenchel 
transform q* A {x), and can be viewed also as a special case of the product of quadratic forms. 
Thus, one of the generalization of K{x) is the product K(x) = h(x)h*(x), where h is a convex 
function and h* (x) is the Legendre-Fenchel transform of h. The product of convex functions has 
been exploited in the field of global optimization under the name of multiplicative programming 
problems. However, to our knowledge, the function like K(x) has not been discussed in the 
literature. It is worth mentioning that the recent study for the product of univariate convex 
functions and the product of quadratic forms can be found in [19, 35], respectively. 

This paper is organized as follows. In Section 2, we establish some general sufficient, 



3 



necessary conditions for the convexity of K(x), and point out that the convexity issue of 
K{x) can be reformulated as a semi-infinite linear matrix inequality or robust positive semi- 
definiteness of matrices. In Section 3, we prove that the convexity of K{x) in 2-dimensional 
space can be completely characterized by the condition number of its matrix. In the Section 
4, we prove an improved sufficient convexity condition for the function K(x) in 3-dimensional 
space by finding an explicit solution range to a class of semi-infinite linear matrix inequalities. 
Conclusions are given in the last section. 

Notation: Throughout this paper, we use A >- (V 0) to denote the positive definite 
(positive semi-definite) matrix. k(A) denotes the condition number of A, i.e., the ratio of its 
largest and smallest eigenvalues: n(A) = A max (^4)/A m i n (j4). qA(x) denotes the quadratic form 
(l/2)x T Ax. 

2 Sufficient, necessary conditions for the convexity of K(x) 

First of all, we note that a sufficient convexity condition for K(x) can be obtained by Theorem 
3.2 in [35] which claims that the following result holds for the product of any two positive 
definite quadratic forms: Let A,B be two nx n matrices and A,ByO. If k(B~ 1 / 2 AB~ 1 / 2 ) < 
5 + 2\/6, then the function (j^x T Ax) (\x T Bx^j is convex. By setting B = A 1 and noting 
that in this case k{B~ x I 2 AB~ X I 2 ) = k(A 2 ), we have the following result. 

Theorem 2.1. Let A be any n x n matrix and ^4^0. If n(A) < \Jf> + 2\/6, then the 
Kantorovich function K{x) is convex. 

Thus, an immediate question arises: What is a necessary condition for the convexity of 
K{x)l This question is answered by the next result of this section. 
Let A be an n x n positive definite matrix. Denote by 

f(x) := q A (x)q A -i(x) = Qx T Ax^j Q^^) = \k{x). (1) 

Clearly, the convexity of K{x) is exactly the same as that of f{x). Since / is twice continuously 
differentiable in R n , the function / is convex if and only if its Hessian matrix V 2 / is positive 
semi-definite at any point in R n . It is easy to verify that the Hessian matrix of / is given by 

V 2 f{x) = q A (x)A- 1 + q A -i (x)A + Axx T A~ l + A^ 1 xx T A. (2) 

Since A is positive definite, there exists an orthogonal matrix U (i.e., U U = I) such that 

A = U T AU, A' 1 = U t A~ 1t I, (3) 

where A is a diagonal matrix whose diagonal entries are eigenvalues of A and arranged in 
non-decreasing order, i.e., 

A = diag (Ai, A 2 , A n ), < Ai < A 2 < • • • < A n . 
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By setting y = Ux, we have 



q A (x) = l -x T {U T KU)x = q A (y), q A -i(x) = ^x T {U T K^x = q A -,(y). 



Notice that 



1 1 n 1 1 n 1 



i=l 



i=l 



Thus, we have 



q A (y)A 1 + q A -i(y)A 

/ IV ( Ai + Al ^1 y2 

2 Z^i=l I Ai A, / "i 



I V™ f Ai _l_ As. ^\ 



2/i + 5 Er=i,^2 a 2^i 



2/n + 2 Ei=l,i/n ^niUi J 



where 



It is evident that 



A ii = ^- + ^-, i,j = l,...,n, i^j. 



Ajj = A,j and Ay > 2 for any i / j. 



(4) 
(5) 
(6) 



Thus, throughout the paper we do not distinguish Ajj and Ajj, and we denote by A = (Ajj), 
the Q(n — l)n) -dimensional vector whose components are Ay's where 1 < i < j < n. On the 
other hand, it is easy to verify that 



Ta-i 



( 2y\ A 12 yiy 2 ■■■ A ln yiy n \ 



(7) 



\ A n iy n yi A n2 y„y 2 ■•• 2y2 J 
where Ay is given by (5). Therefore, by (2), (3), (4) and (7), we have 

V 2 f(x) = U T (qA(y)k- 1 + q A -i (y)k + kyy T k' 1 + k'W k) U 
= U T H n (A,y)U, 



(8) 
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where 



H n (A,y) 

' 3yj + £™ =2 k A ^Vi A 12 yiy 2 • • • Ai„yiy„ 

A 2 iy 2 yi 3y| + £™ =W2 gA^y? • • • A 2 „y 2 y rl 

A nl y nyi A n2 y n y 2 ■■■ ^yl + T,7=i,i^nh A niVi . 

(9) 

Form (8), we see that V 2 /(x) >; for any x G i? n if and only if the vector A = (A^-) satisfies 
that 

tf n (A,y)^0 foranyyGi?" (10) 

Hence, f(x) is convex (i.e., is convex) if and only if (10) holds. The following observation 

is useful for the proof of Theorem 2.3 of this section. 

Lemma 2.2. Let < 7 < 5 be two constants. For any t±,t2 G [l,S], we have 

U to 7 5 
2<^ + f<j + -. 
t 2 t\ 07 

Proof. Let g(v) = v + where f = ■ Since ti, <2 G [7, <S], it is evident that v G in 
which g(z/) is convex. Clearly, the minimum value of ^(^) attains at v = 1, i.e, 5(1") > 2. The 
maximum value of <?(f) attains at one of endpoints of the interval [J, thus g(v) < j + ^, 
as desired. □ 

Noting that < Ai < ■ ■ ■ < A n , by Lemma 2.1 and the definition of Ay, we have the 
following relation: Ajj < A^^, for any k\ < i,j < k 2 and i 7^ j, where 1 < fci,&;2 < n. For 
instance, we have that Ajj < Ai„ for any z / j. We now prove a necessary convexity condition 
which is tight for general K{x). 

Theorem 2.3. Let A be any n x n matrix and A >- 0. // i-T(x) is convex, then k(A) < 
3 + 2^/2. 

Proof. Assume that -K"(cc) is convex and thus f(x), given as (1), is convex. It follows from 
(8) and (9) that H n (A,y) >z for any y G R n . Particularly, H n (A,y) >z for y = e, + e^, 
where i 7^ j and e^, are the ith and jth columns of the n x n identity matrix, respectively. 
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Notice that 



3 + ±A 



Ki 



A 



31 



3 + ±A« 



V 



where all stared entries are positive numbers. Thus, H n (A, y) >z implies that its principle 
submatrix 



3 + iAij 
A,- 



A,; 



-i ,, 3 + r^Aji J 

is positive semi-definite. Since the diagonal entries of this submatrix are positive, the submatrix 
is positive semi-definite if and only if its determinant is nonnegative, i.e., 



3 + 



3 + 



AjjAj, > 0, 



jAjj > 0. Thus, Aij < 6. Since e% and ej can be 



which, by (6), can be written as 9 + 3Ajj 
any columns of the identity matrix, the inequality Ay < 6 holds for any i,j = l,...,n and 
i 7^ j. Lemma 2.1 implies that Ai„ = maxi<i t j< n ^j \j, and hence 

1 



k(A) 



k(A) 



— + — = Ai n 

Ai A n 



max A^ < 



which is equivalent to k(A) < 3 + 2y/2. □ 

Notice that for each fixed y, H n (A, y) y is a linear matrix inequality (LMI) in A. Since y 
is any vector in R n , the system (10) is actually a semi-infinite system of LMIs. Recall that for a 
given function g(u, v) : R p xR q — > R, the semi-infinite inequality (in u) is defined as g(u, v) > 
for any v G 5, where S 1 C i? 9 is a set containing infinite many points. Such inequalities have 
been widely used in robust control [8, 14] and so-called semi-infinite programming problems 
[18], and they can be also interpreted as robust feasibility or stability problems when the value 
of v is uncertain or cannot be given precisely, in which case S means all the possible values of v. 
Recently, robust problems have wide applications in such areas as mathematical programming, 
structure design, dynamic system, and financial optimization [2, 3, 4, 6, 12, 13, 27]. Since A 
should be in certain range such that H n (A,y) >z for any y £ R n , (10) can be also called 
'robust positive semi-definiteness' of the matrix H n . 

Any sufficient convexity condition of K(x) can provide some explicit solution range to the 
system (10). In fact, by Theorem 2.2, we immediately have the next result. 
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Corollary 2.4. Consider the semi-infinite linear matrix inequality (in A): 



A 2 i2/2j/i 



3y 2 2 



Ai 2 yiV2 



Ai n yiy n 

A 2n y 2 y n 



y o 



A n iy n yi A n2 y n y 2 ••• 3y2 + ^ lA m yf 

/or any y G i? n , where Aij's satisfy (6). Then the solution set A* of the above system is 
nonempty, and any vector A = (Ajj) where A^- 6 



2, J5 + 2^/6 



is m A* 



Conversely, any range of the feasible solution A to the semi-infinite linear matrix inequality 
(10) can provide a sufficient condition for the convexity of K(x). This idea is used to prove 
an improved sufficient convexity condition for K(x) in 3-dimensional space (see Section 4 for 
details). 

Combining Theorems 2.2 and 2.3 leads to the following corollary. 

Corollary 2.5. Assume that there exists a constant, denoted by 7* , such that the following 
statement is true: K(x) is convex if and only if n(A) < 7*. Then such a constant must satisfy 



that \jf> + 2y/Q < 7* < 3 + 2>/2. 



However, the question is: Does such a constant exist? If the answer is 'yes', we obtain a 
complete characterization of the convexity of K(x) by merely the condition number. In the 
next section, we prove that this question can be fully addressed for 2-dimensional Kantorovich 
functions, to which the constant is given by 7* = 3 + 2^/2 ( as a result, the bound given in 
Theorem 2.3 is tight). For higher dimensional cases, the answer to this question is not clear at 



present. However, for 3-dimensional cases we can prove that the lower bound y 5 + 2\/6 can 
be significantly improved (see Section 4 for details). 

3 Convexity characterization for K{x) in 2-dimensional space 

We now consider the Kantorovich function with two variables and prove that the necessary 
condition in Theorem 2.3 is also sufficient for this case. 

Theorem 3.1. Let A be any 2x2 positive definite matrix. Then K(x) is convex if and 
only if k(A) < 3 + 2y/2. 



Proof. Notice that there exists an orthogonal matrix U (i.e., U T U = I) such that 



A = U 1 



Pi 

h 



n a- 1 = U 1 
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where Pi, fa are eigenvalues of A. By using the same notation of Section 2, and setting y = Ux 
and n = 2 in (9), we have 



H 2 (A,y) 



3yf + ±Ai 2 yl Ai2yi?/2 
A 2 iy 2 yi 3y| + iA 2 iy 2 



(11) 



For 2-dimensional cases, the vector A is reduced to the scalar A = Ai 2 . If f(x) given by (1) is 
convex in R n , it follows from (8) that i7 2 (A,y) y for any y G R 2 . In particular, it must be 
positive semi-definite at y = e = (1, 1), thus 



# 2 (A,e) 



A 2i 



3+ AAi 2 



Ai 2 
3 + ±A 21 



>- 0. 



Since A i2 = A 2i , it follows that detiI 2 (A,e) = (3 + ^A i2 ) 2 - Af 2 > 0, i.e. 



9 + 3A 12 - ^A 2 2 > 0. 



(12) 



Conversely, if (12) holds, we can prove that / is convex. Indeed, we see that the diagonal 
entries of i? 2 (A,y) are positive, and that 



detff 2 (A,y) 



1 



1 



3yf + ^A 12 y 2 z ) ( 3^ + ^A 12 y z ) - ^{ 2 y\y\ 



= 9y 2 y 2 + %A 12 (yi + y\) - -AWivl 



> 9y 2 y 2 + 3A 12 y 2 y 2 _ |A 2 2 y 2 y 2 
= (9 + 3A 12 - ^A 2 2 ) y\y\ > 0. 

The first inequality above follows from the fact yf + y 2 > 2y 2 y 2 an d the second inequality 
follows from (12). Thus, i7 2 (A,y) y for any y 6 i? 2 , which implies that / is convex. 

Therefore, f(x) is convex (i.e., K{x) is convex) if and only if (12) holds. Notice that the 
roots of the quadratic function 9 + 3t — |t 2 = are t\ = —2 and t 2 = 6. Notice that Ai 2 > 2 
(see (6)). We conclude that (12) holds if and only if Ai 2 < 6. By the definition of Ai 2 , we 
have 



A10 = 



fa , fa 



k{A) + 



1 



fa ' fa ' k(AY 

Thus the inequality Ai 2 < 6 can be written as k{A) 2 — 6k(A) + 1 < 0, which is equivalent to 
k(A) < 3 + 2V2. □ 



It is worth stressing that the above result can be also obtained by solving the semi-infinite 
linear matrix inequality (10). In fact, by Theorem 2.3, it suffices to prove that k(A) < 3 + 2^/2 
is sufficient for the convexity of K(x) in 2-dimensional space. Suppose that k(A) < 3 + 2y/2, 



9 



which is equivalent to A 12 = |j + |^ < 6. Thus, A = A 12 G [2, 6]. We now prove that K{x) is 
convex. Define 

v(A 12 ,y) := detfT 2 (A,y). 
Differentiating the function with respect to A12, we have 
dv(A 12 ,y) 



det 



8A l2 

Differentiating it again, we have 

d 2 v(A 12 , y ) 



\y\ ym 

A212/22/1 3y| + ±A 2 iy 2 



+ det 



3yf + iAi 2 y| Ai 2 yiy 2 

I7/ 2 
2 Vi 



2 l 

2/22/1 



<9 2 A 



= det 



12 



52/2 2/12/2 
2/22/1 2-2/1 



+ det 



52/2 2/12/2 

2/22/1 \y\ 



= --2/?2/ 2 2 < 0. 



Therefore, for any given y G i? 2 , the function t>(Ai2,y) is concave with respect to A12, and 
hence the minimum value of the function attains at one of the endpoints of the interval [2, 6], 
i.e., 

v(A 12 ,y) > min{f (2, y),v(6, y)}. (13) 

Notice that 

3yf + y\ 2 Vl y 2 
22/22/1 3y| + y\ 

where the matrix is diagonally dominant and its diagonal entries are nonnegative for any given 
y G R 2 , and thus it is positive semi-definite. This implies that v(2,y) > for any y G R 2 . 
Similarly, 

62/12/2 



v(2,y) =det 



v(6, y) = det 



32/ 2 + 32/| 
62/22/1 3y| + 3yj 



> 0, 



since the matrix is diagonally dominant the diagonal entries are nonnegative for any y G R 2 . 
Therefore, it follows from (13) that v(A\ 2 ,y) > for any A12 G [2,6] and y G R 2 . This implies 
that when A12 G [2, 6] the matrix i^2(A, y) >z for any y G R 2 , and hence K(x) is convex. 



4 An improved convexity condition for K(x) in 3-dimensional 
space 

In this section, we prove that for K(x) in 3-dimensional space, the upper bound of the condi- 
tion number given in Theorem 2.2 can be improved to < 2 + \/3. To prove this result, we try to 
find an explicit solution range to a class of semi-infinite linear matrix inequalities, which will 
immediately yield an improved sufficient convexity condition for K(x) in 3-dimensional space. 
First, we give some useful inequalities. 
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Lemma 4.1. If 2 < 61,62,63 < 4, then all the functions below are nonnegative: 

(14) 

(15) 

(16) 

(17) 
(18) 



XI 0*1: 


. $2: 


S 3 ) 


:= 663 + 6162 - 


\S38l > 0, 


X2(Sl: 


. $2- 


S3) 


:= 6<5i + 6362 - 


\&A > o, 




• h- 


S 3 ) 


:= 662 + 6163 - 


\S2Sl > o, 


X4(*l: 


^2: 


S3) 


:= Q63 + 6162 - 


\S3Sl > 0, 




$2: 


S3) 


:= n + 6^263- 


si - s\ - s\ > 0. 



Proof. For any given 61,63 G [2,4], we consider the quadratic function (in t): p(t) = 



663 + Sit - \63t 2 which is concave in t. Let t\ be the largest root of pit) = 0. Then 



where the inequality follows from the fact that ^ > |. It is easy to check that the least root 
of pit) = is non-positive, i.e., t\ < 0. Thus, the interval [2,4] C [ii,^] in which the quadratic 
function p(t) > 0. Since 2 < 62 < 4, we conclude that 

Xi(S 1 ,6 2 , 63) = pi6 2 ) = 6S3 + 5iS 2 - -S3S2 > 0. 

Thus (14) holds. All inequalities (15)-(17) can be proved by the same way, or simply by 
exchanging the role of <5i, 62 and £3 in (14). 

We now prove (18). It is easy to see that ip is concave with respect to its every variable. 
The minimum value of ip with respect to 61 attains at the boundary of the interval [2, 4]. Thus, 
we have 

WuhM > min{V>(2,5 2 , 63), ^(4,<5 2 ,<5 3 )} for any 6^62,63 G [2,4]. (19) 

Notice that ^i2, 82,83) and tpi4, 82,63) are concave with respect to 62 G [2,4]. Thus we have 
that 

Y>(2,<5 2 ,<5 3 ) > min{Y>(2, 2, «5 3 ), -0(2, 4, <5 3 )} for any S 2 , 63 G [2, 4], (20) 
^,62,63) > min{V(4, 2, <5 3 ), ^(4, 4, <5 3 )} for any 62,63 G [2,4]. (21) 

Similarly, i/>(2, 2, <5 3 ), ^(2, 4, <5 3 ), ^(4, 2, S3) and ?/>(4, 4, <5 3 ) are concave in <5 3 . Thus, 

Y>(2,2,<5 3 ) >min{V(2,2,2),^(2,2,4} = min{8, 4)} > for any S 3 G [2, 4], 

V>(2,4,5 3 ) > min{V(2,4,2),V(2,4,4} = min{4, 8)} > for any 63 G [2, 4], 

V>(4,2,5 3 ) > min{V(4,2,2),^(4,2,4} = min{4, 8)} > for any 63 G [2, 4], 

V>(4,4,5 3 ) > min{V'(4,4,2),V(4,4,4} = min{8,8)} > for any 63 G [2,4]. 
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Thus, combining (19)-(21) and the last four inequalities above yields (18). □ 



We now focus on developing explicit solution range to certain semi-infinite linear matrix 
inequalities which will be used later to establish an improved sufficient convexity condition for 
K(x) in 3-dimensional space. 

4.1 The solution range to semi-infinite linear matrix inequalities 

Consider the following 3x3 matrix whose entries are the functions in (lo, a, (3) : 



3 + \ujia 2 + wiOi oj 2 P 

<jj\a \w\ + 3a 2 + \uJ3fi 2 oj^afi 

0J2P w 3 a/3 \u 2 + \u?,a 2 + 3/3 2 



(22) 



where u> = (wi, UJ2, ^3) G R 3 , Wj G [2,6] for j = 1,2,3, and a,/3 G [—1, 1]. We are interested in 
finding the range of ui,u}2, and 0J3 in the interval [2,6] such that 

M(u,a,P) h for any a,p G [-1,1], 

which is a semi-infinite linear matrix inequality. To this end, we seek the condition for u, under 
which all the principle minors of M are nonnegative for any a, ft G [—1,1]. 

First of all, we see from (22) that all diagonal entries (which are the first order principle 
minors) of M are positive in the intervals considered. Secondly, since 00 j G [2, 6](j = 1,2,3), 
and a G [—1, 1], it is easy to see that 

3 + -wia 2 = -(6 + ujia 2 ) > -(oji + ojio 2 ) > u)i\a\, 

-wi + 3a = 2^ Wl + 6a ) > 2 Uia ) — Wl l a r 

Therefore, the second order principle submatrix of (22) 

3 + i^ia 2 + \uJ2P 2 w\a. 

ujia \uj\ + 3a 2 + \u03P 2 

is diagonally dominant for any ojj G [2, 6](j = 1,2,3) and a, (3 G [—1, 1]. Similarly, the second 
order principle submatrices 

3 + \uj x a 2 + ±w 2 /3 2 w 2 /3 

uj 2 P ~ \u 2 + iw 3 a 2 + 3/3 2 

iwi + 3a 2 + \ujiP 2 uj 3 a(3 

w 3 a/3 + ±w 3 a 2 + 3/3 2 

are also diagonally dominant for any tjj G [2, 6](j = 1, 2, 3) and a, ft G [—1,1]. Thus, all second 
order principle minors of M are nonnegative in the intervals considered. It is sufficient to find 
the range of u = (lo±,uj2, w 3 ) such that the third order principle minor is nonnegative, i.e., 



detM(u,a,P) > for any a,/3 G [—1,1]. 
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However, this is not straightforward. In order to find such a range, we calculate up to the sixth 
order partial derivative of det M(u), a, (3) with respect to a. The first order partial derivative 
is given as 



ddetM(u,a,/3) 
da 



= det Mi (u, a, f3) + det M 2 (w, a, f3) + det M 3 (u, a, /3), 



(23) 



where Mi(co,a, (3) is the matrix that coincides with the matrix M(uj,a,P) except that every 
entry in the ith row is differentiated with respect to a, i.e., 



Mi(w,a,/3) 



M 2 (u,a,P) 



M 3 (u,a,0) = 



UJ\(X UJ\ 

wia |wi + 3a 2 + ±cj 3 /3 2 
bj 2 P Lu 3 a/3 





aj 3 a;/3 
±w 2 + ±u; 3 a 2 + 3/3 2 



3 + \u x o? + ±w 2 /3 2 
u>i 

uj 2 (3 



LO\Ot 

6a 

w 3 a/3 



1^2 



u> 3 /3 
\-LO3a 2 



•3/3 2 



! 2 



2^^ 1 2 

Wl« 





W 2 



\uj x + 3a 2 + ±cj 3 /3 2 u; 3 a/3 
w 3 /3 w 3 a 



(24) 



(25) 



(26) 



Similarly, the notation Mij(u,a, (3) means the resulting matrix by differentiating, with re- 
spect to a, every entry of the ith and jth rows of M(co, a, f3), respectively; My^u, a, f3) means 
the matrix obtained by differentiating every entry of the ith, jth and kth rows of M(co,a, f3), 
respectively. All other matrices M^h ...(w, a, /3) are understood this way. In particular, the 
matrix such as Maj(oj, a, 0) with some identical indices means the matrix obtained by differ- 
entiating every entry of the ith row of M(oj, a, (3) twice and differentiating every entry of its 
jth row once. Notice that every entry of M(oj,a,(3) is at most quadratic in a. Thus if we 
differentiate a row three times or more, the resulting matrix contains a row with all entries 
zero, and hence its determinant is equal to zero. For example, det Mm 3 (a;, a, /3) = 0. 
Clearly, the second order partial derivative is given as follows: 

d 2 det M(u,a,P) 
d 2 a 

_ d det Mi (w, a, /3) <9det M 2 (w, a, j3) ddet M 3 (u, a, j3) 

da da da 

333 

= ^2detM lj (aj,a,/3) + ^ det M 2j (uj, a, fi) + ^ det M 3j (aj, a, f3) 
j=i j=i j=i 

3 3 

= EE detM ^( a; ' a '/ 3 ) 
i=ij=i 

= det Mn (w, a, /5) + det M 22 (w, a, /3) + det M 33 (u;, a, /?) 
+ 2detMi 2 (w,a,/3) + 2 det Mi 3 (a;, a, (3) + 2 det M 23 (w, a, (3), 



(27) 
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where the last equality follows from the fact Mij(co,a, f3) = Mji(u, a, (3). By differentiating 
(27) and noting that det Mm = det M222 = det M333 = 0, we have 

d 3 det M(u, a, P) ^ 3 3 

d s a = E E E det a, p) 

i=lj=lk=l 

= 3detM n2 (u3,a,P) + 3 det Mu 3 (uj, a, (3) + 3 det M 122 (u, a, (3) 
+ 6 det Mi23(w, a, P) + 3 det Mi 33 (w, a, p) + 3 det M 223 (w, a, /?) 
+ 3detM 233 (w,a,/3). (28) 

By differentiating it again and noting that det Mmj = 0, we have 

d 4 detM(u;,a,/?) J. 3 3 3 

^4- = 2^Z^EE detM ^(^,a,/5), 

i=i j=i fc=i ;=i 

= 6detMn 22 (w,a,/3) + 6detMn 33 (a;,a,/3) +6detM 2 2 33 (w,a,/5) 

+ 12detMn23(w,a,y3) + 12detMi 223 (w,a,/5) + 12 det Mi 233 (w, a, p). (29) 

Finally, 

^^■^ = ±±±±±^M mM «M (30) 

i = l j = l fc = l 1 = 1 p=l 

8Cdet y t r ,a, ' 3) = EEEEEE««*(«-«.fl. (3D 

i=l j=l jfc=l J=l p=l q=l 

Our first technical result is given as follows. 

Lemma 4.2. Let co 1: co 2 , co 3 G [2,4] anda,/3 G [-1,1]. T/ien the function gljetM^W) ^ 
convex with respect to a, and 9 detM^o^] > q 

Proof. Let /? G [—1, 1] and wi, w 3 , W2 G [2,4] be arbitrarily given. Define 5(a) = - det ^^' a '^) _ 
which is a function in a. As we mentioned earlier, if we differentiate a row of M(uj, a, P) three 
times or more, the determinant of the resulting matrix is equal to zero. Thus, the nonzero 
terms on the right-hand side of (31) are only those matrices obtained by differentiating every 
row of M(oj, a, (5) exactly twice, i.e, the terms 



det M ijk i pq (u, a, p) = det Mi 12233 (u, a, j3) = det 



wi 
6 
uj 3 



6wia; 3 > 0, 



and hence 9" (a) = ^detM^p) > Q Thig implieg th&t g ^ = g^detM^W) ig conyex ^ 
respect to a G [—1, 1]. Notice that g'(a) = 9 det M(v,<*,P) _ ^ nQw p rove that 

d 5 deb M(u, a, p) 



</(o) = 



d 5 



a 



= 0. 

a=0 
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When the matrix M is differentiated 5 times with respect to a, there are only three possible 
cases in which we have nonzero determinants for the resulting matrices. 

Case 1: Rows 1, 2 are differentiated twice, and row 3 once. In this case we have 



detMn223(w,a, 0) = det 



wi 
6 

uj 3 (3 oj 3 a 



6ujiuo 3 a. 



Case 2: Rows 1, 3 are differentiated twice, and row 2 once. We have 



det Mii233(w, P) = det 
Case 3: Rows 2, 3 are done twice, and row 1 once. Then 



wi 

uji 6a ui 3 j3 
uj 3 



6u>iuj 3 a. 



det Mi2233(w, a, /3) = det 



U)\OL WlO 

6 

uj 3 



= 6u>iuj 3 a. 



Clearly, 



g'(a) = mi det Mh 2 2 3 {uj, a, (3) + m 2 det Mu 233 (uj, a, f3) + m 3 det Mi 2 233(^, a, (3) 
= 6(mi + rri2 + m 3 )ujiuj 3 a, 

where mi,m2,m 3 are positive integers due to the duplication of the terms in (30), such as 
Mii223(w,a,P) = Mi2i23(^,a,/3) = M 2 2i3i(w, a, /3). Therefore, g'(0) = 0. By the convexity of 
g(a), the minimum value of g attains at a = 0. We now prove that this minimum value is 
nonnegative, and hence g(a) > 0. Indeed, it is easy to see that 





Ldia 


UJl 







UJl 








M1223 (oj, a, f3) = 





6 





, Mii2 3 (uj,a,f3) = 


UJl 


6a 


uj 3 (3 







uj 3 (3 


uo 3 a 







uj 3 /3 


u 3 a 



Mi2 33 (uj,a,/3) 



uj\a uji 
uji 6a ui 3 f3 
u 3 



Mn 33 (uj,a,l3) 



Mn 22 (uj,a,(3) = 



uji 

uiia \ui\ + 3q 2 + \uj 3 f3 2 uj 3 af3 

uj 3 

uji 

6 

uj 2 /3 uj 3 a/3 \ui 2 + \uj 3 a 2 + 3/3 2 
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M 2233 (u;,a,/3) 

Therefore, by (29) we have 

d 4 det M(u,a,p) 



3 + \ujia 2 + \ijJ2P 2 001a u 2 f3 
6 

w 3 



5(0) 



d 4 a 



a=0 



= 12 [det M 1123 (uj, a, /3)\ a=0 + det M 1223 (u;, a,/3)\ a=0 + det Mi 233 (w, a, /3)\ a=0 ] 
+ 6 [det M1122 (w, a, /3) I Q =o + det Mn 33 (w, a, /3) | a=0 + det M 2233 (a;, a, fi) | a=0 ] 
= 18 ( 6wi + w 2 w 3 - -wiwl ) /3 2 + 18 ( 6w 3 + wi^ 2 - -w 2 u; 3 ) . 



By Lemma 4.1, when (jJi,w 2 ,u; 3 G [2,4], we have 



6^1 + W 2 W 3 - -WlWg = X2(W1,W3, w 2 ) > 0, 



6w 3 + wia; 2 - -^iu; 3 



X 3 (W1,W 3 ,W 2 )(= X4(W1,W2,W3)) > 0. 



Therefore, (7(a) > g(0) > for any a G [—1,1]. Since /? G [—1,1], and wi,w 3 ,a; 2 G [2,4] are 
arbitrarily given points, the desired results follows. □ 

Lemma 4.3. Let wi,w 2 ,a; 3 G [2,4] and a, /3 G [—1,1]. T/ien 9 det M(u),a,/3) ^ g convex with 
respect to a, and a = is a minimizer of it, i.e. 



d 2 det M (w, # det M (w, a, /3) 



a 2 



a 



9 2 



a 



a=0 



(32) 



/or any a, f3, co in the above-mentioned intervals. 



Proof. The convexity of - Aet M(u,a,0) reS p ec t to a j s an immediate consequence of 



Lemma 4.2 since 



d 4 det M(u),a,[S) 



, the second order derivative of 



d 2 dctM(u,a,l3) 



We now prove (32). To this end, we first show that 

d 3 dot M(u,a,P) 



Q3 



a 



a=0 



0, 



is nonnegative. 



(33) 



which implies that a = is a minimizer of the second order partial derivative. In fact, by 
(28), to calculate the third order partial derivative with respect to a we need to calculate the 
determinant of Mijk(uj,a, j3). Clearly, 



M 123 (u,a,P) 



uj\ct oji 
w 3 /3 u 3 a 
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and thus det Mi 23 (u;, ft, /3)\ a =o = 0. Similarly, we have 
M 112 (oJ,a,(3) = 



M 113 (u,a,p) 
M 221 {u,a,p) 
M 223 (u,a,P) 



wi 

lui 6a uj 3 j3 

uj 2 j3 oo 3 a/3 \u 2 + \u 3 a 2 + 3/3 2 

wi 

oj\a + 3a 2 + ^lo 3 /3 2 uj 3 a/3 

bJ 3 a 

Wlft £Ji 

6 

w 2 /3 w 3 a/3 ±cj 2 + ^ft 2 + 3/3 2 

3 + i^ift 2 + U\OL co 2 /3 








6 

uj 3 (3 uj 3 a 



which imply that det Mn 2 (w, a, /3)| Q =o = det Mn 3 (co, ft, /3)| a =o = det M 22 i(u;, ft, /3)| Q =o 
detM223(w,ft,/?)| a =o = 0. Finally, we have 



M 331 (u,a,/3) 



M 332 (u;,a,f3) 



oj±a uj\ 

wift \oj\ + 3ft 2 + \u 3 fi 2 uj 3 af3 
" a; 3 

3 + ^cJia 2 + \0J2fi 2 oj\a ou 2 /3 

oj\ 6ft oj 3 f3 

w 3 



Clearly, we also have that det M 33 i(lo, a, (3)\ a= o = det M 332 (lo, a, (3)\ a= Q = 0. From the above 
calculation, by (28) we see that (33) holds. This means that ft = is the minimizer of - M g^'^ 
for any given lo\, u 2 ,uj 3 G [2,4] and j3 G [—1, 1], and thus (32) holds. □ 



Based on the above fact, we may further show that the second order partial derivative is 
positive in the underlying intervals. 

Lemma 4.4. If u x ,uj 3 ,uj 2 G [2,4] and a, € [-1,1], then & M jff£>P) > 0. 

Proof. By Lemma 4.3, ft = is a minimizer of 9 M g^^ . Thus, it is sufficient to show 
that at ft = the second order partial derivative is nonnegative for any given j5 G [—1,1] and 
<jJ\,<jJ3,<jJ2 G [2,4]. Indeed, 

d 2 det Mjuj, a, g) 
<9 2 ft 

= det M u (oj, ft, /?) + 2 [det M 12 (oj, a, j3) + det M 13 (co, a, j3) + det M 23 (u, ft, /?)] 
+ det M 22 {uj, ft, /3) + det M 33 (lo, a, (3) 
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det 



wi 
uia 1<jJ\ + 3a 2 + iw 3 /3 2 









w 2 /3 u 3 af3 


±W 2 + |w 3 Q 






LO\OL D\ 


U 






+ 2 det 


uj\ ba 


L0 3 fJ 










OJod UliCtB ■huio 4- 


1 2 

2 ■ i 


+ 3/3 2 

















+ 2 det 


ijJ\OL + "JO + 


^ 3 /3 2 


w 3 a/3 








w 3 /3 




w 3 a 








" 3 + \ujia 2 + ±u; 2 /3 2 




w 2 /3 




+ 2 det 




6a 


w 3 /3 











w 3 /3 


u; 3 a 








3 + iwiQ 2 + ±u; 2 /3 2 


wia 






+ det 





6 










w 2 /3 


w 3 a/3 


5W2 + \ 






3 + ±wia 2 + ±w 2 /3 2 








+ det 




iwi + 3a 2 + 



















w 2 /3 




w 2 /3 
w 3 a/3 
w 3 



and hence, by setting a = and by a simple calculation we have 
<9 2 det M(uj, a, j3) 3 3 9 



9 2 



= 9w 2 + -wiu;3 - -w 2 w 2 + - f 12 + wiw 2 w 3 - ui\ - - /3 2 

a=0 Z 4 Z V ' 

(3 3 \ 

9w 2 + 2 W i^3 - J /3 4 . 



By Lemma 4.1, for any wi,oj 2 ,u; 3 G [2,4], we have the following inequalities: 



90J 2 + -<jJ\OJz - -^1^2 

3 3 2 

9w 2 + -W1W3 - -w 2 (j 3 



2X4(^1,^3,^2) > 0, 
-Xi(wi,^3,w 2 ) > 0, 



12 + u>iu>2^3 — u\ — W3 — u; 2 = ip(uji, U>3, OJ2) > 0. 



Thus, the minimum value of - de ^^( a '^) j s nonnegative. □ 



We now prove the main result of this subsection. 



Theorem 4.5. If oj\,oj3,ou2 G [2,4], then M(uj,a, (3) ^ for any a, ft E [—1,1]. 

Proof. By Lemma 4.4, det M(ui, a, f$) is convex with respect to a since its second partial 
derivative is nonnegative. We now prove the a = is a minimizer of det M{u, a, /3) for an 
arbitrarily given (3 G [—1, 1] and in [2,4]. Indeed, from (24), (25) and (26), it is easy to see 
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that 

detMi(w,a,/3)| Q= o = det M 2 (w, a, /3)\ a =o = det M 3 (a;, a, f3)\ a=0 = 0, 
and hence by (23), we have 

ddetM(u,a,/3) 

0) 



da 



o=0 



which together with the convexity of det M(u, a, (3) implies that the minimum value of det M(w, a. 
attains at a = 0. Substituting a = into det M(cj, a, (3) we have 



det M(uj, a, (3) 



a=0 



= det 



3 + ±w 2 /3 2 








w 2 /3 



w 2 /3 ic; 2 + 3/3 2 

| (wi + w 3 /3 2 ) Qwa + (^3 - ^ /3 2 + i W2 /3 



(34) 



Notice that the quadratic function 9 — |t + ^gi < for any t G [4,36]. Since oj 2 G [2,4], we 
have that w| G [4, 16] C [4, 36] we conclude that 



3-^ 2 



4(-a; 2 ) (-wa) =9--^ + -^<0 



which means the determinant of the quadratic function \lo2 + ^3 — j<^ 2 ) t + \uJ2t 2 (in t) is 
non-positive, and thus 



which together with (34) implies that 

det M(u, a, (3) > det M(w, a, f3) 



a=0 



> 0. 



Thus, the third order principle minor of M{u, a, (3) is nonnegative. As we mentioned at the 
beginning of this subsection, under our conditions the diagonal entries of M are positive, and 
all second order principle minors of M are also nonnegative. Thus, M(oj, a, (3) y under our 
conditions. □ 



By symmetry, we also have the following result: 



Theorem 4.6. // oj\, 0J3, 0J2 € [2,4], then P(uj,a,f3) y and Q(oj,a,/3) h for any 
a, f3 £ [—1, 1], where 



' 3a 2 + \uj x + 












(jJ20i(3 








1 2 

^a 2 


+ 3 + 










0J2OLj3 






uj 3 (3 




1 2 


+ |W 3 + 


3/? 2 


' 3a 2 + \ujx0 1 


+ 1^2 




uj\a(3 










ujia/3 




\uj\ot 1 


+ 3/3 2 


+ 5^3 








UJ20t 






uj 3 (3 




1 2 
3W2cr 


+ ^ 3 /? 2 


+ 3 
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Proof. This result can be proved by the same way of Theorem 4.5. However, repeating 
the whole similar proof is too tedious. We may prove the result by symmetry. In fact, notice 
that all the analysis and results in this Subsection depend only on the following conditions: 
ui,u)2,uj3 e [2,4] and a, (3 € [—1, 1]. By permuting the rows 1 and 2, and columns 1 and 2 of 
P(uj,a,P), and by setting the substitutions w\ = u±, 0J2 = ^3 and W3 = O2, then P(cu,a,/3) 
is transformed to M(u,a, f3) where u> = (u>i, u>2, £3). Clearly, we have uij £ [2,4] (j = 1,2,3). 
Thus, the positive semi-definiteness of P immediately follows from that of M. Similarly, by 
swapping rows 2 and 3 and swapping columns 2 and 3 of Q , and setting the substitutions 
uj\ = UJ2, 0J2 = and U3 = 0J3, it is easy to see that Q(w,a,/3) can be transformed into 
P(£j,a,(3). Therefore, the result follows from Theorem 4.5 immediately. □ 

4.2 The improved sufficient convexity condition 

For general Kantorovich functions, Theorem 2.3 claims that when K{x) is convex, it must 
satisfy that k(A) < 3 + 2\/2 which is the necessary condition for K{x) to be convex. This 
necessary condition is equivalent to 2 < Ay < 6 for any i / j. Thus any sufficient condition for 

the convexity of K(x) must fall into this range. Theorem 2.2 shows that if k,(A) < \Jf> + 2\/6 ~ 

3.14626 (which is equivalent to 2 < A^ < « 3.4641 for any i^j), then K(x) is convex. 

Based on the result of Subsection 4.1, we now prove that this sufficient convexity condition 

can be improved in 3-dimensional space. 

Notice that in 3-dimensional space, by (8) and (9), the positive semi-definiteness of the 
Hessian matrix of K(x) is equivalent to that of 

H 3 (y) 

' Zy\ + + ±Ai 32 /§ Aisyiya A 13 y m 

= A212/22/1 |A 2 i2/i +3yl + 5 A 23yI A 2 3y 2 y3 

A 3iy 3 yi A322/32/2 2" A 3i^i + i A 32yi + 3y| 

(35) 

The main result of this section is given below. 

Theorem 4.7. If k(A) < 2 + y/S, then H%(y) y for any y G R n , and hence the function 
K(x) = x T Axx T A~ l x is convex. 

Proof. Let Hs(y) be given by (35). Clearly, #3(0) >z 0. In what follows we assume that 

U = {yi-,V2-,yz) / 0. There are three possible cases: 

Case 1. y\ is the component with the largest absolute value: \y\\ > max{|?/2|, |j/3|}- Denote 
by a = ^, (3 = ^. Notice that A y - = Aji for any i ^ j (see (6)). By setting uj = A = 
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(A12, A 13, A23), it is easy to see from (35) that 
3+iA 12 a 2 + iA 13 /3 2 



H 3 (y) 



2 A 12 a 2 + iAi 3 /3 2 A 12 a A 13 f3 

A 21 a iA 2i + 3a 2 + ±A 23 /3 2 A 23 a/3 

±A 31 + ±A 32 a 2 + 3/3 2 



A 32 a(3 



y 2 M(A,a,/3) 



where M(-, v ) is defined by (22). Thus if M(A,a,8) h for any a, 8 € [-1,1], then 

Hs(y) <z for any y such that |yi| > max{|y2|, |?/3|}- 

Case 2. 1/2 is the component with the largest absolute value: I2/2I > rnax{|yi|, \y%\. Denote 



by a = 8 = ^. Then, we have 



H 3 (y) = vl 



3a 2 



±A 12 + ±A 13 /3 2 



A 2i a 
A 31 a/3 



A i2 a 

±A 21 a 2 + 3 + ±A 23 /3 2 
A 32 /3 



A 13 a(3 
A 23 /3 

±A 3ia 2 + ±A 32 +3/3 2 



y 2 2 P{A,a,p) 



where P(-,-,-) is defined as in Theorem 4.6. Thus if P(A,a,8) y for any a, 8 € [—1,1], 

then Hz(y) y for any y such that 1 2/2 1 > m ax{|yi|, 1 2/3 1 } - 

Case 3. 2/3 is the component with the largest absolute value: \y%\ > max{|yi|, 1 2/2 1 } - Denote 



by a = »i, /3 = *a. Then, 



^3(1/) = 2/3 



3a 2 + ±A 12 /3 2 + ±A 



2 ^i3 A 12 a/3 A 13 a 

A 21 a/3 " ±A 21 a 2 + 3/3 2 + ±A 23 A 23 /3 
A 3i a A 32 /3 iA 3 ia 2 + ±A 32 /3 2 + 3 



= y 2 Q(A,a,/3) 

where Q(-, -, •) is defined as in Theorem 4.6. Therefore, if Q(A, a, 8) y for any a, 8 e [—1, 1], 
then H%(y) y for any y such that 1 2/3 1 > max{|yi|, 1 2/2 1 3" - 

If k(A) < 2 + y/3 which is equivalent to Ay G [2,4] where i, j = 1,2,3 and i 7^ j, by setting 
cj = A, i.e., (cji, 0^2,^3) = (A12, A13, A23), from Theorems 4.5 and 4.6 we have that 

M(A,a,B) t 0, P(A,a,B) y 0, Q(A,a,B) h for any a, 8 € [-1,1]. 

Thus, H^iy) y for any y G i? n , and hence -ftT(x) is convex. □ 

The above theorem shows that the upper bound of the condition number in Theorem 2.2 
is improved to 2 + y/S (~ 3.73205) in 3-dimensional space. As a result, Corollary 2.4 can be 
improved accordingly in 3-dimensional space. 



Corollary 4.8. Let n = 3. Assume that there exists a constant 7* such that the following 
statement is true: k(A) < 7* if and only if K(x) is convex. Then the constant 7* must satisfy 
that 

2 + \/3<7*<3 + 2V2. 
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5 Conclusions 

The purpose of this paper is to characterize the convexity of the Kantorovich function through 
only the condition number of its matrix. We have shown that if the Kantorovich function 
is convex, the condition number of its matrix must be less than or equal to 3 + It 
turns out that such a necessary condition is also sufficient for any Kantorovich functions in 
2-dimensional space. For higher dimensional cases (n > 3), we point out that a sufficient 
condition for K(x) to be convex is that the condition number is less than or equal to \J 5 + 2\/6. 
Via certain semi-infinite linear matrix inequalities, we have proved that this general sufficient 
convexity condition can be improved to 2 + a/3 in 3-dimensional space. Our analysis shows 
that the convexity issue of the Kantorovich function is closely related to a class of semi- 
infinite linear matrix inequality problems, and it is also related to some robust feasibility /semi- 
definiteness problems for certain matrices. Some interesting and challenging questions on 
K{x) are worthwhile for the future work. For instance, can we obtain a complete convexity 
characterization for n > 3? In another word, for n > 3, does there exist a constant 7* such 
that k(A) < 7* if and only if K(x) is convex? If the answer to these questions is 'y es \ what 
would be the explicit value or formula (which might depend on n) for this constant? 
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